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ABSTRACT 

This  paper  outlines  some  of  the  basics  of  Dempster-Shafer  Theory,  which  is  a  mathematical 
theory  for  combining  evidence  from  different  sources  to  obtain  a  degree  of  belief  in  a 
proposition.  In  particular,  different  combination  rules  available  within  the  context  of 
Identification  Fusion  and  the  assumptions  and  implications  of  each  of  those  rules  are 
outlined  and  investigated.  However,  the  belief  function  arising  from  combining  evidence 
under  Dempster-Shafer  Theory  is  often  insufficient  to  support  decision-making,  and  a 
transformation  from  the  belief  function  to  a  probability  distribution  is  required.  Several 
different  transformations  and  illustrative  examples  implementing  Dempster-Shafer  Theory 
for  Identification  Fusion  are  provided.  The  results  and  some  possible  directions  for  future 
work  are  discussed. 
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A  Discussion  of  Dempster-Shafer  Theory  and  its 
Application  to  Identification  Fusion 


Executive  Summary 

Sensor  Fusion  is  the  combination  of  sensor  data  obtained  from  different  sources  to  reduce 
uncertainty  and/or  obtain  more  complete  or  accurate  information  than  that  available  from 
any  individual  source.  In  some  cases,  two  or  more  sensors  with  complementary  capabilities 
may  be  used  together  to  obtain  a  target  estimate  that  is  better  than  when  any  individual 
sensor  is  used  alone. 

The  scenario  addressed  by  this  paper  is  that  of  fusing  the  data  from  two  or  more  sen¬ 
sors  to  support  an  operator’s  decision  making  ability  about  the  identity  of  a  target.  This 
paper  outlines  some  of  the  basics  of  Dempster-Shafer  Theory  (DST),  which  is  a  mathe¬ 
matical  theory  for  combining  evidence  from  different  sources  to  obtain  a  degree  of  belief 
in  a  proposition.  It  overcomes  the  Bayesian  constraint  of  needing  precise  fixed  priors  by 
assigning  mass  values  which  may  or  may  not  obey  the  classical  probability  axioms.  This 
means  DST  is  able  to  accommodate  uncertain  knowledge  in  a  manner  which  proponents 
believe  Bayesian  probability  cannot  do,  or  at  the  very  least  is  unable  to  in  an  obvious  and 
straight-forward  manner. 

The  aim  of  this  paper  is  to  describe  the  fusion  of  target  identification  data  within  the  DST 
framework.  Dempster’s  rule  of  combination  is  the  most  common  DST  approach  for  fusing 
data.  Alternative  rules  of  combination  include  Smets’  Transferable  Belief  Model  (TBM), 
Yager’s  Rule,  Inagaki’s  Unified  Combination  Rule  (UCR),  and  Dubois  and  Prade’s  rule. 
Each  of  the  different  rules  of  combination  yields  a  distribution  of  belief  or  mass  and  the 
distribution  may  be  different  depending  on  the  approach  adopted.  An  examination  of 
different  rules  of  combination  is  presented  and  their  underlying  assumptions  described. 
Simple  examples  are  used  to  illustrate  how  the  different  rules  of  combination  are  im¬ 
plemented  and  to  highlight  the  treatment  of  uncertainty  in  the  sensor  data.  However, 
typically,  evidence  that  is  presented  in  the  form  of  a  distribution  of  belief  does  not  directly 
enable  decisions  to  be  made.  Alternative  transformations  from  a  belief  to  a  probability 
distribution  are  described  and  illustrative  examples  presented. 
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1  Introduction 

Sensor  Fusion  is  the  combination  of  sensor  data  obtained  from  different  sources  to  reduce 
uncertainty  and/or  obtain  more  complete  or  accurate  information  than  that  available  from 
any  individual  source.  In  some  cases,  two  or  more  sensors  with  complementary  capabilities 
may  be  used  together  to  obtain  a  target  estimate  that  is  better  than  when  any  individual 
sensor  is  used  alone.  The  scenario  addressed  by  this  paper  is  that  of  fusing  the  data  from 
two  or  more  sensors  to  support  an  operator’s  decision  making  ability  about  the  identity 
of  a  target. 

The  most  popular  methods  for  combining  information  from  multiple  sensors  are  Bayesian 
in  nature,  including  Bayesian  networks  and  Kalman  filters.  Bayesian  networks  are  models 
that  represent  variables  and  their  related  probabilities  and  are  used  to  perform  inference 
based  on  probability  distributions  provided  by  the  sensor  or  other  data  sources  [1].  Halpert 
extended  target  identification  (ID)  used  Bayesian  techniques  for  more  than  two  targets  [2], 
however  Mortiss  noted  that  this  model  does  not  work  well  for  identifying  neutral  targets 
unless  information  on  the  nature  and  number  of  these  targets  is  available  prior  to  en¬ 
gagement  [3].  Krieg  showed  how  Bayesian  Belief  Networks  could  be  used  as  a  framework 
for  linking  target  kinematics  with  attribute  states  [4],  whilst  a  comprehensive  tutorial  in 
Bayesian  Belief  Networks  was  also  given  by  the  same  author  [5].  Masked  [6]  addressed  a 
well-known  problem  of  handling  conflicting  information  by  setting  appropriate  values  for 
the  probability  distributions.  A  key  requirement  for  all  Bayesian  techniques  is  that  the 
prior  probability  distributions  must  be  specified  in  advance. 

Dempster-Shafer  Theory  (DST)  overcomes  the  Bayesian  constraint  of  needing  precise  fixed 
priors  by  assigning  mass  values  which  may  or  may  not  obey  the  classical  probability 
axioms.  This  means  DST  is  able  to  accommodate  uncertain  knowledge  in  a  manner  which 
proponents  believe  Bayesian  probability  cannot,  or  at  the  very  least  is  unable  to  do  in  an 
obvious  and  straight-forward  manner. 

As  an  example,  suppose  that  p{ai)  and  p{a2)  denote  the  probability  of  there  being  rain 
in  a  day  or  not  respectively.  Then  clearly,  p{ai)+p{a2)=l-  However,  if  we  let  m{ai) 
and  m{a2)  depict  the  masses  for  there  being  rain  or  not,  then  under  the  DST  framework 
m(Q!i)  -|-  m{a2)  +  m{ai  U  a2)  =  1,  where  m(ai  U  a2)  denotes  the  uncertainty  mass  of  not 
knowing  whether  it  will  rain  or  not.  Note  that  if  m(Q;i  U  a2)  =  0,  then  this  is  no  more 
than  the  classical  probability  axiom,  but  this  is  not  a  requirement  in  DST.  However,  the 
computational  complexity  of  DST  can  quickly  escalate  and  while  Bayesian  probability  is 
universally  accepted,  the  same  cannot  be  said  of  DST;  partly  due  to  it  being  relatively 
new  (only  a  few  decades  old).  Further,  the  concept  of  belief  masses  and  what  this  means 
is  still  contested  by  some  statisticians  and  mathematicians  working  in  this  area. 

In  effect,  Dempster-Shafer  Theory  (DST)  is  another  method  for  combining  information 
from  multiple  sensors  [7,  8,  9]  and  may  be  regarded  as  a  collection  of  different  theories 
with  the  same  underlying  uncertainty  calculus  and  has  its  origins  in  the  representation  of 
and  reasoning  under  partial  probability  distributions  over  a  finite  set  of  elements,  called 
0.  It  encodes  uncertainty  by  assigning  normalised  non- negative  quantities  (basic  belief 
masses)  to  the  elements  of  the  power  set  of  0  (which  are  of  order  2l®L  Notwithstanding 
the  fact  that  Random  Set  Theory  has  been  proposed  by  Mahler  as  a  unifying  framework 
between  different  types  of  fusion  rules  [10,  11],  the  main  benefits  of  DST  are  that  it 
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specifically  caters  to  the  handling  of  ambiguity  in  systems  that  provides  a  more  flexible 
means  of  representing  ignorance  than  probability  theory,  and  it  is  able  to  handle  random 
variables  with  uncertain  belief  masses.  Furthermore,  DST  has  also  recently  been  used 
with  other  techniques  [12,  13,  14,  15,  16]  for  such  purposes  as  rule-based  inferencing  and 
group  decision-making  among  others. 

In  this  paper,  particular  emphasis  is  placed  on  the  fusion  of  information  and  sensor  data 
from  multiple  sources  which  contains  uncertainty.  Bogler  [17]  detailed  a  use  of  Dempster’s 
rule  of  combination  using  data  from  sensors  such  as  a  Radar  Warning  Receiver  (RWR), 
a  Television  Sighting  Unit  (TVSU)  and  an  Infra-Red  Search  and  Track  (IRST)  sensor. 
DST  has  also  been  applied  to  multitarget  tracking  in  clutter  [18],  whilst  another  approach 
which  uses  DST  for  joint  tracking  and  classification/intent-detection  can  be  found  in  [19]. 
Also  recently,  there  has  been  some  work  done  in  trying  to  fuse  soft  data  [20,  21,  22].  This 
is  data  that  has  a  qualitative  nature  to  it  and  is  typically  used  in  every  day  speech,  for 
example  “that  car  is  near  the  city”  or  “that  person  is  walking  really  fast”.  Apart  from 
the  fact  that  fixed  priors  must  be  known,  or  at  best,  it  is  sometimes  argued  that  Bayesian 
methods  can  be  too  restrictive  a  fashion,  Bogler  also  notes  that  Bayesian  techniques  do 
not  explicity  address  the  problem  where  the  sensor  data  presents  conflicting  information. 
A  recent  survey  of  techniques  that  seek  the  address  the  problem  of  uncertainty  from 
a  theoretical  and  practical  perspective  is  provided  in  [23].  More  specifically,  Hadzagic  et 
ah  [24]  considered  the  relevance  and  reliability  in  assessing  evidence  for  the  implementation 
of  DST  to  Electronic  Support  Measures.  Koks  and  Challa  [25]  compared  Bayesian  fusion 
with  DST,  whilst  Smets,  as  will  be  discussed  later,  also  wrote  extensively  on  the  issue 
of  comparing  DST  techniques  with  other  methods  for  fusion,  including  specihcally  to  the 
problem  of  target  detection  and  ID. 

Other  recent  interesting  applications  of  DST  involve  the  behaviour  and  decision-making 
aspect  of  travellers  [26]  and  for  the  quantification  and  uncertainty  propagation  of  AIS  ship 
data  [27].  The  application  considered  in  this  paper  is  the  problem  of  determining  the  ID,  or 
identification,  for  a  target  of  interest  given  data  from  multiple  sensors.  In  this  context,  ID 
can  be  simply  described  as  the  task  of  determining  or  classifying  the  identity  of  unknown 
targets  of  interest.  Depending  on  the  specific  requirements  for  the  application,  the  target 
ID  may  refer  to  the  unique  platform  (e.g.  a  specific  ship  such  as  HMAS  Anzac:  FFH150), 
the  platform  type  (eg  Anzac  Class  Frigate),  the  platform  category  (eg  Frigate)  and/or 
the  platform  allegiance  (eg  Friend).  Data  about  the  ID  of  a  target  may  be  received  from 
heterogeneous  sources  including  imaging,  acoustic  and  RF  sensors  and  even  intelligence. 
The  source  data  may  relate  to  different  levels  of  the  target  ID  so  that  one  sensor  may  be 
able  to  determine  the  allegiance  while  another  may  be  able  to  determine  platform  type 
details. 

The  objective  of  sensor  fusion  is  to  estimate  the  states  of  targets  given  data  from  multiple 
sensors.  For  a  given  sensor,  the  target  ID  may  be  ambiguous,  for  example  a  sensor  may 
not  be  able  to  resolve  between  several  possible  platform  types.  For  many  applications, 
the  target  state  comprises  kinematic  and  attribute  components,  that  are  estimated  using 
Tracking  and  ID  Fusion  functions,  respectively.  A  sequential  process  for  sensor  fusion 
whereby  a  tracking  function  is  followed  by  ID  fusion  as  shown  in  Figure  1  is  considered. 
Data  from  each  of  the  sensors  is  initially  processed  by  a  tracking  function  which  performs 
data  association  to  determine  the  sensor  data  that  corresponds  to  the  same  target.  Typ¬ 
ically,  a  kinematic  target  state  estimate  is  computed  by  the  tracking  function  based  on 
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kinematic  components  of  the  sensor  data. 


The  focus  of  this  paper  is  on  ID  fusion  which  has  the  goal  of  fusing  ID  data  from  multiple 
sensors  to  estimate  the  ID  of  targets.  In  the  DST  framework  an  ID  Combination  function 
processes  the  target  ID  evidence  and  obtains  a  belief  or  mass  distribution.  A  Probability 
Transformation  is  then  performed  to  determine  a  discrete  probability  distribution  of  the 
target  ID.  The  output  of  Sensor  Fusion  is  a  track  corresponding  to  each  target  with 
estimates  of  the  kinematic  state  and  of  the  target  ID. 


Sensor  fusion  supports  decision  making  by  processing  uncertain  sensor  data  to  arrive  at 
target  state  estimates.  If  a  track  on  a  target  corresponds  to  a  large  freighter,  then  the 
decisions  of  an  operator  may  be  quite  different  than  if  the  track  corresponds  to  a  small, 
fast  vessel.  Furthermore,  a  decision  maker  may  be  informed  by  the  uncertainty  in  the 
target  ID.  For  example,  if  it  is  equally  likely  that  the  target  is  a  large  freighter  or  a 
small,  fast  vessel  then  a  decision  maker  might  choose  to  make  a  conservative  decision  that 
is  appropriate  to  their  mission.  Another  course  of  action  might  be  for  the  user  to  task 
sensors  in  an  attempt  to  reduce  the  uncertainty  in  the  estimate  of  the  target  ID. 


Figure  1:  Functional  flow  for  Sensor  Fusion 


The  aim  of  this  paper  is  to  describe  the  fusion  of  target  ID  data  within  the  DST  frame¬ 
work.  Dempster’s  rule  of  combination  is  the  most  common  DST  approach  for  fusing  data. 
Alternative  rules  of  combination  include  Smets’  Transferable  Belief  Model  (TBM),  Yager’s 
Rule,  Inagaki’s  Unified  Combination  Rule  (UCR),  and  Dubois  and  Prade’s  rule.  Each  of 
the  different  rules  of  combination  yields  a  distribution  of  belief  or  mass  and  the  distribution 
may  be  different  depending  on  the  approach  adopted.  An  examination  of  different  rules 
of  combination  is  presented  and  their  underlying  assumptions  described.  Simple  examples 
are  used  to  illustrate  how  the  different  rules  of  combination  are  implemented  and  to  high¬ 
light  the  treatment  of  uncertainty  in  the  sensor  data.  However,  typically,  evidence  that  is 
presented  in  the  form  of  a  distribution  of  belief  does  not  directly  enable  decisions  to  be 
made.  Alternative  transformations  from  a  belief  distribution  to  a  probability  distribution 
are  described  and  illustrative  examples  presented. 
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2  Basic  Concepts  of  Dempster-Shafer  Theory 

In  this  section,  some  of  the  basic  concepts  of  DST  are  described.  The  set  of  possible 
target  types  that  might  be  present  is  known  as  the  frame  of  discernment  (FoD).  The 
construction  of  the  FoD  is  outlined  together  with  the  belief  function  assignment,  which 
is  the  designation  of  values  or  beliefs  regarding  target  ID.  Other  issues  like  whether  this 
FoD  should  be  closed  or  open  are  also  discussed. 


2.1  Frame  of  Discernment  (FoD) 

At  the  heart  of  DST  is  the  Frame  of  Discernment  (FoD)  which  is  typically  denoted  by  0£) 
and  is  a  model  that  describes  the  set  of  possible  hypotheses. 

Suppose  that  the  FoD  is 


©D  =  {Gi,02,  ■  ■  ■  ,0k}, 

where  the  9i,l  <  i  <  k,  are  mutually  exclusive  and  contained  in  the  set  labelled  &£>. 

A  Closed  World  Assumption  (CWA)  implies  that  ©d  is  exhaustive,  so  that  at  least  one 
of  the  9i  must  be  true.  If  it  is  known  that  8i  is  not  true,  then  the  CWA  implies  that  the 
conclusion  must  correspond  to  one  of  the  other  k  —  1  possibilities. 


2.2  Expanding  the  Frame  of  Discernment  (FoD) 

It  is  possible  to  extend  the  FoD  by  combining  it  with  another  [28].  For  instance,  if  we 
have  a  frame  that  describes  the  platform  type  so  that 

01  =  {9i,92,  6*3}. 

then  this  FoD  for  platform  type  can  be  combined  with  another  that  describes  the  platform 
allegiance, 

^  =  {</>!)  </>2,  4>3, 

A  fully  expanded  FoD  is  obtained  from  the  cross  product: 

©^^)  =  ©1  X 

{{9i,(f)i),  i9i,(j)2), . . . ,  {93,4)4)}, 

which  contains  12  elements.  However,  in  many  cases  it  may  be  known  with  certainty  that 
(  f) 

not  all  elements  in  ©2  '  will  be  present.  For  instance,  the  true  FoD  could  be  the  subset  of 
5  elements: 

©2  =  {{9l,4)l),  {9i,4)2),  (92,4)2),  (92,4)3),  (92,4)4)}  ■ 

Further,  there  might  be  different  levels  of  resolution  so  that 

03  =  {(9l,4)l,'4)l),  (9l,4)l,'4)2),  (9i,4)2),  (92,4)2),  (02,4)3),  (94,4)4,4)3),  (6'4,04,V’4)}- 
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Figure  2:  The  FoD  ©3  represented  as  an  irregular  tree 


A  depiction  of  this  FoD  in  the  form  of  an  irregular  tree  is  shown  in  Figure  2. 

Turning  to  the  problem  of  ID  fusion,  a  key  question  is:  How  can  masses  from  sensors 
which  have  different  levels  of  resolution  be  fused?  An  answer  to  this  question  lies  in 
forming  appropriate  subsets  of  the  FoD  that  are  applicable  to  02’s  FoD.  For  example,  ©2 
can  be  expressed  as 

©2  =  {((6'i,  01,^1)  U  (6'i,(/>i,U2)),  (6'i,</>2),  {02,4>2),  (02,  h),  {{02,4>4,i^3)  ©  (02,  (/>4,  ^4))} 

SO  that  if  there  is  evidence  for  (0i,  cpi)  then  that  evidence  can  be  applied  to  the  set  union 
(01,  (/>i,  Ui)  U  (01,  </>i,  V’2))  where  Au  B  means  set  of  those  elements  which  are  either  in  A, 
B  or  in  both. 


2.3  Belief  Function  Assignment 

In  DST,  uncertain  evidence  is  encoded  as  a  belief  function  or  sometimes  referred  to  as  a 
potential  function  (this  is  at  times  distinctly  used  to  distinguish  it  from  Bayesian  proba¬ 
bility  theory  [29,  30])  which  can  be  expressed  in  a  variety  of  equivalent  ways.  There  are 
several  ways  of  encoding  this  uncertain  evidence  which  include:  mass,  belief,  plausibility 
and  commonality.  Denoting  the  function  [(j)]m,  [fplb,  [f'jph  etc  can  be  used  to  indicate 
which  representation  of  the  potential  is  being  used. 
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The  most  common  means  of  encoding  is  via  the  basic  belief  assignment  (bba)  which  is  a 
function  m  defined  on  the  power  set  of  the  frame  of  discernment  0^)  as  follows: 

m  :  p(0d)  [0, 1] 

m{X) 

subject  to  the  constraint  that  Y^xaSn  where  Ofc  =  oi  +  02  •  ■  •  +  o,n-  The 

power  set  of  a  set  of  elements  is  simply  the  set  of  all  possible  subsets  of  elements  from  that 
set.  For  instance,  for  a  FoD  0i  =  {01,02,^3}  consisting  of  |0i|  =  3  elements,  the  power 
set  is  simply 

p(0i)  =  2®i 

=  {{0},  {^2},  {03},  (01, 02},  (01, 03},  (02, 03},  (01, 02,  03}}. 

For  a  given  set  A  C  0^),  the  belief  mass  m(A)  represents  the  proportion  of  all  relevant 
and  available  evidence  that  supports  the  claim  that  a  particular  element  of  0£)  belongs  to 
the  set  A,  but  to  no  particular  subset  of  A.  We  remark  that  mathematically,  m  is  akin  to 
a  probability  distribution  when  the  only  non-zero  belief  masses  are  all  assigned  to  subsets 
containing  a  single  element,  also  known  as  a  singleton.  As  already  noted,  no  interpretation 
of  the  belief  masses  as  probabilities  is  made  in  some  interpretations  of  the  DST. 

To  combine  evidence  from  two  independent  belief  functions  represented  as  bbas  mi  and 
m2,  Dempster’s  rule  of  combination  is  typically  used  and  the  resulting  belief  function, 
denoted  by  mi_2  =  mi  ©  m2. 


2.4  Belief,  Plausibility  and  Commonality 

Two  other  useful  encodings  in  one-to-one  correspondence  with  the  basic  belief  assignment 
m  are  the  belief  and  plausibility  functions  which  are  defined  in  the  following  manner: 

bel(A)  =  ^  m{Y)  and  pl(A)  =  ^  m{Y). 

07^ycx  ynx^0 

For  a  given  subset  A,  bel(A)  quantifies  the  extent  to  which  the  evidence  supports  A, 
while  pi  (A)  quantifies  the  extent  to  which  the  evidence  does  not  contradict  A. 

Another  encoding  frequently  used  is  commonality.  It  can  be  defined  as  the  quantity  of 
the  agent’s  belief  which  may  eventually  be  assigned  to  A.  Mathematically,  it  can  be 
represented  as  follows: 


q{X)  =  Y,  m{Y), 

XCY 

where  A  d  B  means  the  set  that  contains  all  those  elements  that  A  and  B  have  in  common. 
Its  real  value,  however,  lies  in  the  fact  that  it  can  lead  to  much  faster  computation  of 
masses  (see  [31])  due  to  the  following  property: 
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q,2{X)  =  qi{X).q2{X), 

where  qi,i  =  1,2  are  the  commonality  values  assigned  to  X  from  two  different  sources  and 
qi2{X)  is  the  fused  commonality  value  of  the  qiS. 

Note  that  DST  can  handle  probabilities  in  the  traditional  manner,  but  this  is  not  a  re¬ 
quirement  in  order  to  use  DST.  However,  a  link  can  be  made  to  standard  probability 
measures  by  noting  that  belief  and  plausibility  may  be  interpreted  as  lower  and  upper 
bounds  on  the  probability,  respectively.  Note  that  in  the  remainder  of  this  tutorial  we 
adopt  the  conventional  terminology  where  the  term  belief  function  is  used  to  refer  to  the 
bba  rather  than  bel(X). 


2.5  Closed  World  Assumption  (CWA) 

Recall  that  the  Closed  World  Assumption  (CWA)  implies  that  the  FoD  is  exhaustive  so 
that  the  solution  must  correspond  to  one  of  the  elements  in  0£).  In  other  words,  there  is 
zero  probability  or  chance  that  the  true  solution  might  lie  outside  of  0£). 

Consider  two  situations  where  a  CWA  might  be  made.  Firstly,  the  FoD  may  be  known 
to  be  complete  so  that  all  possible  solutions  are  listed.  For  target  ID  applications,  such 
an  assumption  means  that  the  sensors  all  have  fully  encoded  knowledge  of  the  possible 
platform  types  and  are  able  to  produce  a  belief  mass  distribution  based  on  a  library  of 
possible  platform  types  for  subsequent  ID  Fusion.  Secondly,  the  FoD  may  be  known  to  be 
incomplete  but  is  treated  as  complete  so  that  the  actual  solution  may  lie  outside  of  the 
FoD.  In  this  situation,  misidentification  of  a  target  may  result  from  evidential  reasoning 
applied  to  a  FoD  that  is  not  exhaustive.  Misidentification  could  lead  to  a  target  allegiance 
being  declared  hostile  when  in  fact  it  should  have  been  declared  friend  leading  to  fratricide 
or  “friendly  fire” .  It  is  clearly  important  to  be  aware  of  the  underlying  assumption  about 
the  FoD. 


2.6  Open  World  Assumption  (OWA) 

Lack  of  complete  knowledge  about  the  complete  or  entire  solution  space  is  known  as  having 
an  0pen  World  Assumption  (0WA).  Under  the  CWA  a  statement  about  a  set  element 
leads  to  inferences  about  the  remaining  set  elements  in  the  FoD.  However,  for  an  OWA, 
no  such  inference  can  be  made.  While  the  CWA  may  lead  to  inference  errors  if  the  frame 
of  discernment  is  incomplete,  the  OWA  places  restrictions  on  the  deductions  that  can  be 
made  from  the  available  information. 

Assuming  a  FoD  0£),  the  OWA  assumption  implies  that  if  an  inference  is  made  about 
01, ... ,  9k-i  then  we  cannot  conclude  anything  about  Ok-  The  true  FoD  is  actually 

&F  =  ©D  U  Qe, 

where  O^;  is  another  frame  of  unknown  size  so  that  0^5  n  Og  =  0.  That  is,  both  frames 
have  distinct  elements  and  there  is  no  “doubling- up”  across  each  FoD. 
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3  Dempster’s  Rule  of  Combination 

3.1  Definition 

To  combine  evidence  from  two  independent  belief  functions  represented  as  bbas  mi  and 
m2,  Dempster’s  rule  of  combination  is  typically  used  and  the  resulting  belief  function, 
denoted  by  mi^2  =  ®  "i2,  is  given  by  the  following  direct  sum: 

(mi  ©m2)(A)  =  — *—  V  mi(y)m2(Z),  (1) 

''  >  Ynz=x 

where  the  conflict  mass  k  is  given  by 

K=  ^  mi(V)m2(Z).  (2) 

Yr\Z=% 

In  Dempster’s  rule  of  combination,  the  empty  set  mass  is  zero,  i.e.  m(0)  =  0  [32].  It  should 
be  noted  that  Dempster’s  rule  is  commutative,  i.e.  mi  ©  m2  =  m2  ©  mi  and  associative, 
i.e.  (mi  ©  m2)  ©  m3  =  mi  ©  (m2  ©  m3),  when  the  basic  belief  masses  are  compatible,  i.e. 
are  not  in  complete  conflict. 


3.2  Conflict 

A  conflict  between  two  beliefs  in  DST  is  where  one  source  strongly  supports  one  hypoth¬ 
esis  and  the  other  strongly  supports  another  hypothesis,  so  that  the  two  hypotheses  are 
not  compatible.  The  conflict  mass  k  quantifies  the  disparity  of  information  between  two 
sources  and  is  non-zero  whenever  there  is  an  incompatibility  in  the  hypotheses  correspond¬ 
ing  to  each  source  [33].  The  closer  that  k  is  to  1,  the  more  we  should  reconsider  how  the 
evidence  influences  the  beliefs  [34].  If  fusing  two  pieces  of  information  (so  that  one  has  a 
belief  in  X  and  the  other  in  X)  result  in  k  =  1  then  it  is  clear  that  this  fusion  does  not 
justify  any  support  for  either  option  (viz  either  X  or  A). 

Zadeh  [35]  showed  that  normalization  in  Dempster’s  rule  would  apparently  resolve  conflict 
but  might  result  in  misleading  final  values,  whilst  [36]  describes  a  novel  technique  for 
discounting  evidence  in  proportion  to  the  degree  of  conflict  it  provides.  An  example 
was  presented  where  two  medical  doctors  {hi  and  /i2)  inspected  a  patient  and  provided 
diagnoses  for  meningitis  (M),  concussion  (C)  and  tumor  (T).  Their  advice  is  listed  in 
Table  1. 

Table  1:  Medical  Diagnoses  of  Two  Doctors 


Medical 

Diagnosis 

Doctor 

M 

C 

T 

hi 

0.99 

0.00 

0.01 

h2 

0.00 

0.99 

0.01 

In  this  example  when  all  masses  are  combined  according  to  Dempster’s  rule,  we  have  the 
counter-intuitive  result  that  the  fused  masses  give  a  value  of  1  for  tumor.  This  result 
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arises  because  the  only  common  non-zero  masses  are  those  that  correspond  to  tumor. 
Other  difficulties  with  DST  have  been  discussed  by  Dezert  et.al.  [37,  38,  39].  Bayesian 
fusion  techniques  give  a  similar  result  to  DST  approaches  when  prior  information  is  not 
carefully  defined  [6]. 


4  Alternative  Rules  of  Combination 


Dempster’s  Rule  of  Combination  is  the  most  common  DST  approach  for  fusing  information 
from  multiple  sources.  In  this  section  we  describe  the  following  approaches  in  which 
evidence  from  multiple  sources  is  fused: 


•  Smets’  Transferable  Belief  Model  (TBM) 

•  Yager’s  rule 

•  Inagaki’s  Unified  Combination  Rule  (UCR) 

•  Dubois  and  Trade’s  rule 


Note  that  these  are  probably  the  most  well-known  transformations  and  each  make  subtle 
assumptions  about  the  nature  of  the  FoD  as  will  be  discussed  shortly.  Furthermore,  a 
different  approach  for  treating  evidence  implies  that  there  is  a  different  approach  for  the 
management  of  conflicting  information  in  the  source  data.  The  management  of  conflict  is 
described  for  each  of  the  different  rules  of  combination. 


4.1  Smets’  Transferable  Belief  Model 

The  most  well-known  rule  for  explicitly  accounting  for  conflict  is  the  Transferable  Belief 
Model  (TBM)  [40].  In  this  case,  the  combination  rule  is  given  by 

m^(A)  =  (mi  ©  m2)(Y)  =  ^  mi{Y)m2{Z),  (3) 

Yr\Z=x 


where  the  mass  for  the  empty  set  m(0)  >  0  in  contrast  to  Dempster’s  rule  of  combination 
and  many  other  rules  of  combination  where  m(0)  =  0. 

The  effect  of  normalisation  with  the  conflict  mass  k  in  Dempster’s  rule  of  combination 
is  that  a  closed  world  is  assumed.  However,  by  allowing  the  possibility  of  the  empty  set 
mass  to  be  non-zero  in  Smets’  TBM,  this  implies  that  an  open  world  is  assumed.  The 
TBM  is  commutative  and  associative,  just  like  Dempster’s  rule  of  combination.  It  is  worth 
noting  that  Smets  has  co-authored  a  number  of  papers  regarding  the  application  of  TBM 
to  target  detection  [41,  42,  43,  44,  45,  46,  47,  48]. 
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4.2  Yager’s  Rule 


Yager’s  rule  of  combination  is  similar  to  Shafer  and  Smets’  TBM  but  it  differs  in  the  treat¬ 
ment  of  mass  that  is  assigned  to  the  empty  set  [49].  Yager  defines  the  ground  probability 
assignment  associated  with  X  as 

r{X)  =  ^  mi{Y)m2{Z). 

Yr\Z=x 

The  bba  arising  from  Yager’s  rule  of  combination  is  given  by 

m^iX)  =  r{X)  (4) 

^^(©d)  =  r(0i5) r(0)  (5) 

so  that  any  conflict  n  =  r(0)  is  assigned  to  the  FoD  Qjo.  That  is,  Yager’s  rule  implies  a 
CWA  because  the  possible  solutions  are  contained  in  the  FoD  0£). 


We  can  relate  Yager’s  rule  to  Dempster’s  rule  in  the  following  manner 


m'^(0) 

md^{X) 


^(Qp) 

1  —  r(0) 
r{X) 

1  -  r(0)  ’ 


Note  that  Yager’s  rule  is  commutative  but  not  associative.  Rather,  Yager’s  rule  is  con¬ 
sidered  to  have  a  “quasi-associative”  property,  which  means  that  the  operator  can  be 
broken  down  into  associative  suboperations  [32].  Yager  justified  this  property  on  the  basis 
that  sometimes  the  order  of  operation  does  matter;  in  particular  when  new  information  is 
available  and  this  is  incorporated  into  an  already  combined  structure  [49]. 


4.3  Inagaki’s  Unified  Combination  Rule 


Inagaki’s  UCR  is  a  parametrized  class  of  combination  operations  that  incorporates  both 
Dempster’  rule  and  Yager’s  rule.  Specifically,  Inagaki  [50]  argues  that  every  combination 
rule  can  be  expressed  in  the  form: 


m*(A)  =r(A)  +  /(A)r(0), 
where  A  /  0,  with  m*(0)  =  0,  and 

^  f{X)  =  1,  /(A)  >  0. 

XCp(0b),X^0 


From  these  equations  Inagaki’s  unified  combination  rule  can  be  derived  so  that: 

m^(A)  =  [1 -I- ar(0)]g(A),  where  A  7^  p(0£)),0 

=  [1 ag(0)]g(0p) [1 ar(0)  -  a]r(0) 

^  “  1  -  r(0)  -  r{QD) 


(6) 

(7) 

(8) 


If  a  =  0  then  the  results  are  equivalent  to  Yager’s  rule  and  if  a  =  1/[1  —  r(0)],  then 
the  masses  are  equivalent  to  those  obtained  using  Dempster’s  rule.  The  Inagaki’s  UCR  is 
commutative,  but  quasi-associative. 
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4.4  Dubois  and  Prade’s  Rule 

Dubois  and  Prade  [51]  took  a  set-theoretic  approach  in  arriving  at  their  rule 

(mi  ©  m2)‘^^(X)  =  ^  mi{Y)m2{Z).  (9) 

YUZ=X 


The  use  of  the  set  union  in  Dubois  and  Prade’s  rule  avoids  consideration  of  the  conflict 
mass  and  so  there  is  no  requirement  for  normalisation.  The  Dubois  and  Prade  rule  of 
combination  is  commutative  and  associative. 


4.5  Other  Rules  of  Combination 

There  are  other  rules  of  combination  which  have  been  developed  within  the  DST  frame¬ 
work  such  as  Zhang’s  Center  Combination  Rule  (CCR)  [52],  Mixing  or  Averaging  [53],  the 
Adaptive  Combination  Rule  (ACR)  [56],  Convolutive  X- Averaging  [53],  the  Cumulatative 
rule  [54],  the  Cautious  rule  [55],  the  collection  of  Proportion  Conflict  Resolution  (PCR) 
rules  [37,  56]  and  Zhang’s  Center  Combination  Rule  (CCR)  is  the  same  as  Dempster’s 
for  some  cases  and  scaling  problems  may  arise  in  some  instances  ([32]).  The  Mixing  or 
Averaging,  the  Cumulative  and  the  ACRs  rely  on  weights  being  assigned  to  the  mass  dis¬ 
tributions,  whilst  the  Convolutive  X- Averaging  deals  with  information  given  as  intervals. 
The  Cautious  rule  assumes  dependency  among  the  evidence  sources  which  is  not  consid¬ 
ered  here,  while  the  PCR  set  of  rules  are  5  rules  proposed  use  a  variant  or  variants  of  the 
conjunctive  rule  to  redistribute  the  conflicting  mass  on  non-empty  sets  according  to  in¬ 
tegrity  constraints.  Finally,  Dubois  and  Prade  [57]  proposed  another  rule  which  combines 
both  their  original  disjunctive  rule  with  Smets’  TBM.  These  are  not  described  further  in 
this  paper  but  thorough  reviews  of  various  DST  rules  of  combination  have  been  provided 
elsewhere  such  as  in  [56],  [58]  and  especially  [32].  Lastly,  most  of  these  rules  are  also  more 
thoroughly  reviewed  by  Nguyen  and  Docking  ([59]). 


5  Probability  Transformations 

The  bba  distribution  of  the  DST  framework  does  not  readily  support  decisions  to  be 
undertaken.  Transformations  are  required  to  convert  from  the  bba  function  to  a  probability 
distribution,  however  other  approaches  to  decision-making  not  involving  transformations 
have  also  been  proposed  [12,  15,  16].  This  ID  fusion  approach  where  ID  combination  is 
followed  by  a  probability  transformation  corresponds  to  two  levels  of  reasoning  within  the 
DST  framework.  At  the  credal  level,  evidence  is  represented  and  manipulated  using  rules 
of  combination,  while  at  the  pignistic  level,  evidence  is  manipulated  into  a  probability 
distribution  to  support  decision-making. 

In  a  similar  manner  to  the  rules  of  combination,  there  are  several  probability  transforma¬ 
tions  within  the  DST  framework.  Four  different  probability  transformations  are  described 
here: 


•  Aggregate  Uncertainty  Approach  (AU) 
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•  Plausibility  Transformation  (PIT) 

•  Pignistic  Transformation  (PiT) 

•  Generalised  Pignistic  Transformation  (GPT) 


Each  of  the  probability  transformations  result  in  a  distribution  of  probability  P{X)  over 
singleton  elements,  where  \X\  =  1,  of  the  FoD  Qq.  A  decision  may  then  be  made  by,  for 
example,  declaring  the  target  type  that  is  most  highly  supported  by  the  amassed  evidence 
to  be  Afc,  where  ,  P{Xi). 

The  probability  transformations  listed  above  are  among  the  most  popular  and  they  obey 
the  minimal  criterion  which  essentially  states  that  the  result  of  applying  the  transform 
to  the  vacuous  belief  function  should  be  the  uniform  probability  distribution  [60].  A 
vacuous  belief  function  is  where  the  ignorance  mass,  or  the  mass  over  the  entire  FoD  is  1. 
Other  probability  transformations  not  considered  here  include  those  of  Sudano  [61]  and 
Guzzolin  [62], 


5.1  Aggregate  Uncertainty  Approach 

The  Aggregate  Uncertainty  (AU)  approach  method  distributes  the  masses  of  non-singleton 
elements  to  the  constituent  singletons  and  then  normalises  to  arrive  at  a  probability 
mass  [63].  For  instance,  the  mass  m(Y  U  Z)  is  distributed  to  m{Y)  and  m{Z),  where 
|y|  =  \Z\  =  1.  The  AU  method  selects  the  proportion  of  m(V  U  Z)  distributed  to  each  of 
the  singletons  in  such  a  way  as  to  minimise  the  entropy  of  the  resulting  probability  distri¬ 
bution  [8].  Again,  for  a  binary  frame  (i.e.  Two  elements),  this  process  is  straightforward: 
if  m(X)  >  0.5,  the  transform  yields  Pi{X)  =  m{X)  and  Pi{Y)  =  m{Y)  +  m{X  U  U), 
otherwise  Pi{X)  =  Pi{Y)  =  0.5. 


5.2  Plausibility  Transformation 

Also  known  as  the  Bayesian  approximation  method, the  Plausibility  Transformation  (PIT) 
has  been  argued  that  this  transformation  is  the  most  consistent  technique  for  translating 
Dempster’s  rule  of  combination  to  the  Bayesian  probability  domain  [64].  For  a  binary 
frame,  if  A  E  ©£>,  where  a  £  A  indicates  that  the  element  a  belongs  to  the  set  A,  the 
probabilities  given  by  this  transformation  become 

P2{X)  =  K-^pl{X), 


where  K  =  Eyeeo  pK^)- 

5.3  Pignistic  Transformation 

The  Pignistic  Transformation  (PiT)  [40]  moves  the  belief  mass  from  the  union  elements 
of  the  power  set  and  distributes  it  equally  amongst  the  corresponding  singleton  members. 
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For  an  element  X  G  &£>,  the  PIT  gives 


P3{X) 


ih^YQp{@o)-X& 


m{Y) 


(10) 


where  k  is  defined  as  before. 

An  OWA  is  made  under  TBM.  However,  once  the  pignistic  transform  is  required  to  make 
a  decision  it  is  then  really  apportioning  masses  to  the  singletons  as  though  a  CWA  is  being 
made. 


5.4  Generalized  Pignistic  Transformation 


The  Generalized  Pignistic  Transformation  (GPT)  aims  to  maximise  the  Probabilistic  In¬ 
formation  Content  (PIC).  Here  it  suffices  to  say  that  maximization  of  the  PIC  is  equivalent 
to  minimization  of  Shannon’s  entropy  [37].  For  an  element  X  ^  Qd,  the  transformation 
is  given  by 


Pi{X)  =  m{X)  +  {m{X)  +  e) 


E 

Y 

XCY,\Y\>2 


m{Y) 

y~]  ze«i(e0)  rn{Z)  -|-  e\Y 

ZCV,|Z|=1 


(11) 


for  some  small  arbitrary  e  >  0. 


6  Example  Sensor  Data 

In  this  section  we  describe  several  different  cases  that  are  used  to  illustrate  the  application 
of  DST  to  target  ID. 

First,  we  consider  a  FoD  that  describes  allegiance  so  that 

©a  =  {F,77,A}, 

where  F,  H  and  N  represent  friendly,  hostile  or  neutral  allegiance,  respectively.  Further,  for 
each  allegiance  type,  there  are  certain  associated  platform  types.  This  can  be  represented 
by  another  FoD: 


ep  =  {A,B,C,D,E}. 

Next,  combining  these  frames  of  discernment  a  fully  expanded  FoD  with  15  platform  types 
(that  include  allegiance)  is  obtained.  These  are: 

=  {FA,FB,...,NE}, 

where  the  first  letter  represents  the  allegiance  and  the  second  indicates  the  platform  type. 
In  addition,  if  it  is  known  with  certainty  that  the  true  target  ID  does  not  correspond  to 
certain  elements  within  the  FoD,  then  these  may  be  pruned  from  further  consideration  so 
that  the  FoD  is  reduced  to  a  smaller  set  of  elements,  as  in  the  example  of  ©d  below: 
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0^  =  {fa,  FC,  FD,  HB,  HC,  NA,  NC,  NE], 

Figure  3  shows  the  FoD  for  target  ID  depicted  as  a  tree  of  two  layers  with  allegiance  and 
platform  type,  respectively.  Reducing  the  FoD  in  this  manner  is  one  option  to  reduce  the 
computation  complexity.  However,  from  a  modelling  perspective,  preserving  the  original 
frame  with  15  elements  might  be  preferable  in  case  there  is  evidence  that  could  be  used  to 
break  the  deadlock.  Also,  from  the  point  of  view  of  maintaining  the  underlying  database 
or  ontology,  this  might  provide  a  cleaner  solution. 


Allegiance 

F  H  N 


Platform  Type 


Figure  3:  A  tree  representing  the  FoD  for  the  example  listed  above 

As  an  aside,  on  occasions,  platform  types  may  be  combined  so  that  FAC  may  be  used 
in  the  instance  where  sensors  may  not  be  able  to  discern  between  platform  types  FA 
and  FC.  In  a  similar  manner,  other  classification  labels  could  be  added  to  include  target 
environment  such  as  air,  surface  and  land. 

We  assume  that  there  is  a  single  target  of  interest  and  that  sensor  data  relating  to  the 
target  ID  is  to  be  processed.  Five  different  cases  are  considered  in  which  uncertain  data 
about  the  target  ID  is  received  from  two  or  more  sensors.  Table  2  shows  the  example 
sensor  input  data  for  ID  fusion.  There  are  two  sensors  under  consideration  for  cases  1  to  4 
while  for  case  5  there  are  three  sensors.  The  sensor  input  data  is  represented  as  a  discrete 
mass  distribution  over  the  FoD  where  the  sum  of  the  masses  for  each  sensor  is  unity.  A 
blank  entry  in  Table  2  implies  that  the  corresponding  mass  is  zero. 
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Table  2:  Example  sensor  input  data  for  ID  fusion 


Case 

Sensor 

Target  ID 

FA 

HB 

NA 

AC 

FAUNA 

F 

H 

A 

©D 

51 

0.7 

0.2 

0.1 

i 

52 

0.6 

0.2 

0.2 

O 

51 

0.7 

0.2 

0.1 

Z 

52 

0.4 

0.6 

o 

51 

0.7 

0.2 

0.1 

o 

52 

0.6 

0.2 

0.2 

A 

51 

0.7 

0.2 

0.1 

52 

0.6 

0.4 

51 

0.7 

0.2 

0.1 

5 

52 

0.6 

0.2 

0.2 

53 

0.6 

0.4 

In  case  1,  the  two  sensors  provide  belief  masses  for  singleton  elements.  Sensor  51  provides 
non-zero  masses  for  the  elements  FA,  HB  and  NA  while  the  data  for  sensor  S2  differs  in 
the  values  for  FA,  HB  and  the  mass  for  A  A  is  zero  while  the  mass  for  AC  is  non-zero. 

In  case  2,  sensor  51  provides  the  same  mass  distribution  as  for  case  1.  However  sensor  52 
provides  non-zero  mass  for  element  HB  and  for  the  union  (FAUNA).  In  other  words, 
sensor  52  is  able  to  discern  that  the  platform  type  is  either  HB  or  one  of  FA  or  NA, 
without  being  more  specific  about  the  latter  two  types. 

Some  sensors  may  only  be  able  to  discern  data  at  the  allegiance  level.  In  case  3  sensor 
51  provides  the  same  mass  distribution  as  for  cases  1  and  2,  but  sensor  52  provides  data 
at  the  allegiance  level  only  so  the  mass  distribution  is  for  the  elements  of  F,  H  and  A. 
The  allegiance  elements  may  be  expanded  in  terms  of  their  constituent  platform  types,  ie. 
F  =  {FA  UFCU  FD},  H  =  {HB  U  HC]  and  A  =  {NA  U  AC  U  NE}. 

In  case  4  each  sensor  attributes  some  of  the  mass  to  elements  of  the  FoD  and  the  remaining 
mass  is  attributed  to  the  FoD  Qd.  This  is  a  means  of  handling  missing  information. 

In  case  5  there  are  three  sensors.  Sensors  51  and  52  provide  mass  distributions  for 
singletons  while  sensor  53  provides  mass  for  NA  and  Qd- 


7  Results 


In  this  section  we  present  the  results  of  applying  ID  fusion  to  the  example  sensor  data 
described  in  the  preceding  section.  We  consider  ID  combination  first  where  the  DST  rules 
of  combination  are  applied  to  obtain  a  combined  bba  from  multiple  bbas  provided  by  each 
sensor.  Then,  a  probability  transformation  is  applied  to  transform  the  combined  bba  into 
a  probability  distribution. 
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7.1  ID  Combination  results 

For  the  purposes  of  illustrating  how  the  ID  combination  rules  are  applied,  we  adopt  a 
tabular  representation  similar  to  that  of  Sentz  et  al  [32],  as  shown  in  Table  3.  The 
following  steps  show  how  Dempster’s  rule  of  combination  in  equations  (1)  and  (2)  may  be 
implemented  for  the  example  data  corresponding  to  case  2.  The  fused  mass  is  assigned  to 
the  intersection  of  the  elements  of  both  masses  being  fused.  The  second  line  in  each  entry 
in  Table  3  indicates  the  resulting  element  to  which  the  fused  mass  is  assigned. 

1.  Compute  the  combined  mass  for  an  element  as  the  product  of  the  masses  from  the 
corresponding  row  and  column.  For  instance, 

mi{F  A)m2{H  B)  =  0.28, 

contributes  to  k  since  {FA}  n{i?F}  =  0.  Or 

mi{F A)m2{F AU  N A)  =  0.42, 

contributes  to  the  mi  0  m2{F A)  since  {FA}  n  {FA  U  NA}  =  {FA}. 

2.  Conflicting  evidence  arises  when  the  intersection  is  empty,  eg.  F Ar\  HB  =  ^.  The 
product  of  the  masses  from  the  corresponding  row  and  column  contributes  to  the 
summation  for  the  conflict  k  in  equation  2. 

3.  Sum  the  masses  for  the  conflict: 

K  =  ^  mi{Y)m2{Z) 

Ynz=0 

=  mi{F A)m2{H B)  +  mi{N A)m2{H B)  0  mi{H B)m2{F A  U  NA) 

=  0.28  +  0.04  0  0.12  =  0.44 

4.  Where  the  intersection  is  not  empty,  this  represents  supporting  evidence  for  the  set 
intersection.  The  product  of  the  masses  from  the  corresponding  row  and  column 
contributes  to  the  summation  for  the  mass  of  the  relevant  set  in  equation  1,  eg. 
{FA}  n  {FA  U  NA}  =  {FA}. 

5.  Sum  the  masses  for  each  element,  eg: 

(mi0m2)(FA)  = 


The  results  for  ID  combination  using  the  different  DST  rules  of  combination  for  each  of 
the  5  cases  are  presented  in  Table  4  with  the  notation  =  m{Y  U  Z).  The  value  of 

the  parameter  a  in  Inagaki’s  Unified  Combination  Rule  was  selected  to  be  1/[2(1  —  r(0)], 
which  is  the  halfway  point  between  Yager’s  rule  and  Dempster’s  rule. 


(1  -  k) 

1 

(1-0.44) 
0.42 


^  mi(Y)m2(^), 


Ynz=FA 


mi{F A)m2{F A  U  N A) 


0.56 


=  0.75 
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Table  3:  Example  of  Dempster’s  rule  of  combination  applied  to  case  2 


Sensor  2 

ID 

HB 

FAUNA 

mi 

0.4 

0.6 

Sensor  1 

ID 

m2 

FA 

0.7 

mi{F  A)m2{H  B)  =  0.28, 
FAr^HB  =  {D 

mi{F A)m2{F A  U  N A)  =  0.42, 
FA  n  {FA  U  NA)  =  FA 

HB 

0.2 

mi{H  B)m2{H  B)  =  0.08, 
HBnHB  =  HB 

mi{H B)m2{F A  U  NA)  =  0.12, 
HB  n  {FA  U  NA)  =  0 

NA 

0.1 

mi{N  A)m2{H  B)  =  0.04, 
NAnHB  =  $ 

mi{N A)m2{FA  U  NA)  =  0.06, 
NA  n  {FA  U  NA)  =  NA 

It  can  be  seen  in  Table  4  that  the  belief  masses  arising  from  each  of  the  ID  combination  rules 
sum  to  one,  as  required.  A  coarse  inspection  reveals  that  Dubois  and  Prade’s  disjunctive 
rule  typically  yields  a  belief  mass  distribution  for  a  greater  number  of  elements  than  the 
other  rules  of  combination.  On  the  other  hand,  Dempster’s  rule  usually  produces  non-zero 
belief  masses  for  the  least  number  of  set  elements  because  it  generally  does  not  assign 
mass  to  the  empty  set,  0  or  to  the  FoD  ©£>,  like  Dubois  and  Prade’s  rule.  The  exception 
to  this  is  in  case  4  when  Qjo  appears  in  the  sensor  input  data. 


7.2  Probability  Transformation  results 

Probability  transformations  are  applied  to  compute  probability  values  from  the  belief  mass 
distributions  obtained  from  the  ID  combination  function.  The  belief  masses  obtained  by 
applying  the  Dubois  and  Prade  rule  of  combination  to  case  2  are  used  to  illustrate  the 
different  probability  transformations.  The  reason  for  selecting  the  results  of  the  Dubois 
and  Prade  rule  of  combination  is  simply  that  the  corresponding  bba  for  case  2  has  nonzero 
masses  for  elements  with  cardinality  of  one,  two  and  three.  In  Table  5,  Pi,P2,  P3  and  P4 
denote  the  AU,  the  PiT,  the  PIT  and  the  GPT  respectively  as  applied  to  the  bba  arising 
from  the  Dubois  and  Prade  rule  of  combination  for  case  2.  For  the  GPT  the  value  of  e 
was  set  to  0.001. 


8  Discussion 

Several  interesting  features  of  ID  fusion  using  DST  can  be  observed  from  the  results 
tabulated  in  Tables  4  and  5.  A  comparison  of  the  results  in  Table  4  for  Smets’  and 
Yager’s  rules  shows  that  the  belief  mass  distribution  are  usually  similar  in  that  conflict  is 
distributed  to  the  empty  set  mass  in  the  former  and  to  the  ignorance  or  universal  mass 
in  the  latter.  Dubois  and  Prade’s  rule  yields  similar  results  to  Yager’s  rule.  This  is  not 
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Case 

Rule  of  Combination 

Dempster 

Smets 

Yager 

Inagaki 

Dubois  &  Prade 

1 

0.91^^'^^ 

0.09(^^) 

0.42CA 

0.04(^^^) 

0.54(®) 

0.42C'^1 

0.04(^^) 

0.54(®o) 

0.67C'^) 

0.06(^^) 

0.27(®o) 

0.42C'^1 

0_26(^a,hb) 

0.04(^«) 

0.06(^AiVA) 

qq2{HB,NA) 

Q_Q^iHB,NC} 

Q_Q4(7VA,iVC) 

0_i4(FA,ivc) 

2 

0.75(^^) 

0.42l^^) 

0.08(^^) 

0.44(®) 

0.42(^^1 

0.08^^^) 

0.44(®o) 

0.59(^^) 

0.08(^^) 

0.22(®o) 

0_04(^^bNA) 

0.28(-^^’^^) 

0.08(^®) 

0_48(i’’A,iVA) 
g  42('^A,HB,AfA) 

3 

0.88^^'^^ 

0.08(^^) 

0.04(^^) 

0.42CA 

0.04(^^) 

0.02(^^) 

0.52(®) 

0.42C'^1 

0.04(^^) 

0.02(^^) 

0.52(®o) 

0.65C'^) 

0.03(^^) 

0.26(®o) 

q_q^{HB,HC) 
q_qq{NA,FA,FC,FD) 
q_q^{HB,NA,NC,NE) 
q_q2‘^NA,HB,HC) 
g  42('f^^FA,FC,FD) 

0  14('^A,ArA,iVC,AfE) 
Q_l^iFA,HB,HC) 
g  g2(AfA,AfC,Ar£;) 

4 

0.09(^^) 

QQfiFA,NA) 

0.05(®o) 

0.08^^^) 

q,qq(FA,NA) 

0.12(®) 

0.04(®o) 

0.70^^^) 

0.08(^^) 

0.16(®o) 

0.75(^^) 

0.09^^^) 

o.io(®o) 

g_42(FA,WA) 

q_12(^A,HB,NA) 

0.46(®^) 

5 

0.91^^'^^ 

0.09(^^) 

O.I7CA 

0.02(^^) 

0.82(®) 

~oaW^ 

0.02(^^) 

0  32(iVA 

0.49(®^) 

0.37C'^) 

0.04(^^) 

Q_23(AfA) 

0.37(®n) 

0_29(i’’A,WA) 

g_gg(FA,iVA,7Vi?) 

0  q2'^hb,na,ne) 

QQfiNA,NE) 

g_g4(//S,iVA) 

0.40(®o) 

Table  4-  Belief  mass  distributions  obtained  using  different  DST  combination  rules  for  five 
cases  of  sensor  data  input. 


surprising  given  that  they  both  assume  a  closed  world.  However,  whilst  Yager’s  rule  assigns 
any  conflict  to  the  universal  mass,  Dubois  and  Prade’s  rule  assigns  its  conflict  to  the  union 
of  the  elements  of  the  fused  masses. 
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Table  5:  Results  using  the  different  probability  transformations  applied  to  the  belief  mass 
distribution  obtained  from  Dubois  and  Trade’s  rule  of  combination  for  case  2. 


ID 

Probability  Transformation 

Aggregate 
Uncertainty,  PI 

Plausibility 
Transformation,  P2 

Pignistic 

Transformation,  P3 

Generalized  Pignistic 
Transformation,  P4 

FA 

0.33 

0.43 

0.42 

0.25 

HB 

0.33 

0.26 

0.28 

0.51 

NA 

0.33 

0.31 

0.3 

0.24 

But,  as  can  be  observed  from  the  last  example,  this  method  yields  the  trivial  result  when 
the  output  from  one  source  is  completely  ignorant.  It  is  interesting  to  note  for  all  the  rules, 
except  Dubois  and  Prade’s,  if  even  one  sensor  does  not  contain  a  value  for  an  element  of 
the  power  set  (say  hi),  but  all  the  other  sensors  or  sources  do,  then  when  fused  no  mass 
will  be  allocated  to  hi.  This  is  in  effect  what  occurred  with  Zadeh’s  example;  even  though 
a  doctor  gave  a  large  confidence  value  for  a  particular  diagnosis,  a  value  of  zero  from  the 
other  doctor  effectively  “overpowers”  that  diagnosis.  Indeed,  a  value  of  zero  or  one,  is 
very  profound  and  is  something  to  bear  in  mind  when  using  the  aforementioned  rules. 
Hence,  it  is  better  to  assign  at  least  some  amount  of  non-zero  mass  to  the  FoD  to  offset 
this  phenomenon.  This  is  analogous  to  the  rule  of  practice  of  using  a  non-zero  likelihood 
function  in  Bayesian  reasoning.  The  other  interesting  example  is  case  5  using  Dubois  and 
Prade’s  rule.  Here,  we  see  that  even  though  sensor  3  does  not  in  effect  add  or  take  away 
any  information,  fusing  it  with  the  other  2  sensors  renders  maximum  ignorance.  In  this 
case,  the  ignorance  of  this  sensor  overpowers  any  other  information  provided  by  the  other 
two. 

The  results  for  the  probability  transformations  in  Table  5  demonstrate  that  the  AU  rule 
and  the  GPT  have  quite  different  outcomes.  The  AU  rule  aims  to  maximise  the  entropy, 
whilst  the  GPT  aims  to  minimise  it.  In  terms  of  uncertainty,  the  AU  transformation  is  the 
most  pessimistic,  while  the  GPT  is  the  most  optimistic.  The  PiT  and  PIT  sit  somewhere 
in  between.  Advocates  of  the  PiT  claim  that  it  takes  the  most  prudent  betting  criterion. 
Advocates  of  the  PIT  claim  that  it  most  closely  resembles  Bayesian  probability. 

Typically  the  platform  type  with  the  largest  probability  might  be  reported  to  a  user,  how¬ 
ever  for  AU  this  is  not  possible.  Note  that  the  platform  type  with  the  highest  probability 
for  P4  differs  from  that  of  P2  and  P3.  The  occurrence  of  different  decisions  suggests  that 
careful  selection  of  the  probability  transform  is  needed.  In  many  target  ID  applications, 
exceedance  of  a  specified  threshold  is  required  before  any  decision  on  the  platform  type 
is  made.  Further,  if  a  decision  is  made  so  that  the  declared  platform  type  is  that  which 
is  given  by  the  maximum  probability  then  just  from  this  example  above,  this  will  vary 
according  to  the  probability  transformation.  For  the  AU,  all  platform  types  are  equally 
probable  and  it  would  be  interesting  to  find  out  if  this  rather  dramatic  result  is  due  to  the 
AU  transformation  itself  or  rather  as  a  result  of  Dubois  and  Prade’s  transformation.  For 
the  PIT  and  the  PiT,  FA  is  the  most  likely  platform  type,  whilst  for  the  GPT,  HB  is  the 
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most  likely  platform  type.  As  an  example,  if  a  threshold  of  0.5  was  applied  for  a  decision, 
then  only  the  GPT  would  yield  a  decision  for  the  platform  type,  even  though  out  of  all 
the  transformations,  this  is  the  only  one  that  would  conclude  that  HB  was  the  most  likely 
target. 


9  Conclusion 

In  this  paper,  an  overview  of  DST  and  its  application  to  the  problem  of  ID  fusion  using 
data  from  multiple  sensors  have  been  presented.  Several  examples  have  been  provided 
to  demonstrate  various  rules  of  combination  and  probability  transformations  in  the  DST 
framework.  Underlying  assumptions  for  open  and  closed  worlds  are  paramount  in  guiding 
the  selection  of  the  most  appropiate  ID  combination  rule.  With  this  in  mind,  further  work 
will  concentrate  on  the  significance  of  assuming  an  open  versus  a  closed  world  assumption 
with  regards  to  target  identification  and  classification  and,  by  implication,  the  different 
ways  in  which  conflicting  reports  are  resolved  at  a  decision-making  level.  In  addition,  an 
understanding  on  when  perhaps  one  rule  might  be  preferred  in  this  type  of  application 
will  also  be  the  subject  of  future  study,  as  is  the  investigation  of  how  evidence  received  at 
different  time  instants  may  be  updated  in  the  DST  framework.  Future  work  will  focus  on 
comparing  DST  to  Bayesian  fusion  for  target  ID. 
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